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Abstract 



In this paper we present estimates of roommate and institution based peer effects. 
Using data from the College & Beyond survey, the Freshman survey, and phonebook 
data that allows us to identify college roommates - we estimate models of students’ 
political persuasion and intellectual engagement. The evidence suggests that a student’s 
roommate’s political sentiments have some impact on their own political views later in 
life. We also implement a cluster based analysis that attempts to answer the question: 
how would a student’s outcomes have changed if they’d attended a very different school? 
Our findings suggest that student outcomes are, indeed, sensitive to the school they 
attend. Similar students attending schools that have a decidedly different “ethos” differ 
in important ways post-college. Institutional peer effects seem to have a powerful effect 
on student outcomes. 
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I. Introduction 



The existence and nature of peer effects are fundamental to understanding an 
array of crucial issues central to the economics of education. A variety of policies 
affecting both higher and lower education entail a redistribution of students. Affirmative 
action, school choice, or busing, for example, all redistribute students across schools. 
Ability tracking redistributes students within a school. Such policies serve to alter the 
peer attributes of a school or a classroom. These changes, in turn, may impact individual 
students. If students are sensitive to the peer environment of the school or the classroom, 
then such redistributions can affect both individual and aggregate outcomes. 

Peer effects play an additional role in higher education. In a series of papers, 
Gordon Winston (c.f. Winston, 1997) demonstrates that a central feature of American 
higher education is that students routinely pay a net price significantly below the cost of 
their education 1 . The resulting subsidies are both ubiquitous and sizeable (c.f. Winston, 
1999). Winston suggests that such subsidies serve, in part, to enhance the pool of 
applicants to a school, attracting students who embody attributes an institution deems 
valuable. Student quality, in this model, is seen as an important input in the production 
of education. Schools try to attract high quality students, in part, because it is believed 
they confer positive educational externalities on other students. The model assumes that 
students “advance farther” academically when in the presence of academically strong 
students - just as elite athletes or musicians, for example, “advance farther” in the 
company of other strong athletes or musicians. 



1 In a for-profit world, this would mean losing money on every unit produced. 
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Estimating peer effects is difficult. First, we must decide on the appropriate set of 
educational outcomes believed to be sensitive to peer attributes. Second, we must specify 
the relevant peer attributes. Third, and perhaps most difficult, we must contend with the 
fact that selection bias is rampant in the estimation of peer effects. In a laboratory 
setting, we might randomly assign a set of subjects to different peer environments and 
gauge any resulting effects. In the world of non-experimentally derived observations, 
however, we must recognize that people choose their peers. And, if people tend to 
associate with others with similar traits (many of which are likely to be unobservable to 
the researcher), then it is exceedingly difficult to determine whether we are observing 
peer effects or simply observing similar people behaving similarly. 

Empirical estimates of peer effects go back at least as far as the prominent 
Coleman Report of 1966. The Coleman Report not only concluded that peer effects 
existed and were significant in shaping educational attainment - with students being 
seriously advantaged or disadvantaged depending on the quality of their fellow- 
classmates - but it asserted, too, that those effects were non-linear - that the weak student 
benefited more from association with strong classmates than those strong students lost in 
associating with weaker classmates in studies of primary and secondary education . 

Subsequent work on peer effects in higher education has focused on the selection 
issue and has attempted to implement quasi-experimental strategies wherein students are, 
in effect, randomly assigned environments of differing peer quality. One fruitful line of 
work has utilized the random assignment of first year college roommates. Such a 
strategy allows the comparison of high, medium, and low ability students (typically 

2 For an interesting strategy to estimate peers in an experimental context, see Goethals (2001). 

3 (Coleman, Campbell et al. 1966). 
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measured by SAT scores) with high, medium, or low ability roommates. For example, 
weak students roomed with weak students might be compared to weak students roomed 
with strong students. Differences in, say, the students’ grade point average might then 
credibly be attributed to the different peer environments (i.e. the strength or weakness of 
their roommate). Studies of this sort were initially done by researchers at Dartmouth 
College and Williams College (see Sacerdote, 2001 and Zimmerman, 2002). These 
papers found evidence both for the existence and the non-linearity of peer effects (though 
the evidence on any non-linearities is less consistent). Another paper (Stinebrickner and 
Stinebrickner, 2002) noted that the first two studies were based on extremely high ability 
students and replicated the analysis using data from Berea College. At Berea, there was 
no evidence that roommate’s ACT scores impacted their roommate’s grades. There was, 
however, evidence that their socioeconomic background did matter. A recent paper using 
data from Vanderbilt University (Gleason and Siegfried, 2003) finds evidence of 
academic peer effects while studies at the University of Maryland (Foster, 2002) and an 
unnamed university (Kremer and Levy, 2003) do not. One possible explanation for these 
differences lies in the possibility of non-linearities in the effects. Dartmouth, Williams 
and Vanderbilt, for example, are extremely selective institutions and have very high 
average SAT scores. At these schools, weaker students showed no evidence of peer 
effects. To put it somewhat differently, the academic performance of weaker students did 
not seem to be sensitive to the peer quality of their roommates. The University of 
Maryland or Berea College, however, are less academically selective and so the sample 
data from those schools may more closely resemble the weaker subsample of students at 



4 




Dartmouth, Williams, or Vanderbilt. This may explain the different findings inasmuch as 
no peer effects were found amongst weaker students at Williams or Vanderbilt. 

There has also been some experimental work done to estimate peer effects. 
Goethals (2001) constructed teams of three undergraduates at Williams College who 
spent forty minutes reading and discussing carefully selected New York Times articles 
then wrote about and discussed what they had learned. The ability level of each student 
was controlled for, and the quantity and quality of the discussions and writings were 
quantified by trained coders. The findings were striking, with groups composed of 
students with homogeneous ability levels, whether strong or weak, outperforming groups 
with mixed ability. 

In this paper, we try to broaden the existing literature in two ways. First, we’ll 
expand the set of individual outcomes and peer characteristics being considered. Second, 
we’ll try to develop a measure of institutional peer effects; a measure of the “ethos” of 
different institutions and the impact it has on students. We use data from the Andrew W. 
Mellon Foundation’s College and Beyond (C&B) survey. First, we utilize roommate 
level data from two of the schools in the survey. These schools were selected because 
they use a quasi-random approach to assigning roommates. This allows us to implement 
a quasi-experimental research strategy whereby we contrast the outcomes of similar 
students who were assigned different peer environments. The richness of the data allows 
us to extend the current literature by considering outcomes other than grades and peer 
environments other than roommate’s SAT scores. Second, we address a somewhat 
broader context for peer effects by asking the following question: how would a student’s 
outcomes have changed if he’d attended a very different school? This is a more difficult 
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question and entails an analysis that provides a way of measuring a schools peer “ethos.” 
Based on that measure, we can group (thirteen) schools in the C&B data that have a 
similar “ethos.” Finally, we can contrast the outcomes of students who have 
characteristics that match well with their school’s ethos with students whose 
characteristics do not. 

The paper is organized as follows. In the next section, we describe the roommate 
based approach to measuring peer effects. Section III provides estimates from this 
approach. Section IV describes the clustering approach to characterizing the institutional 
ethos or peer environment. Section V provides findings associated with this approach. 
Section VI concludes. 

II. Estimating Peer Effects using Roommates Characteristics 

Estimates of SAT -based peer effects generally relate the grades of students in 
their first and later semesters to their own SAT scores and to the SAT scores of their first 
year roommate. More formally, regression models are specified as: 

(1) GPA ic = a+y c + ftSAT, + J3 2 SAT™ + /TV + £•, 

where GPA is the student’s Grade Point Average measured in the first year and also 
cumulatively to graduation, SAT is the student’s own SAT score (sometimes entered 
separately for math and verbal scores and also sometimes entered nonlinearly), SAT RM is 
the student’s roommate’s SAT score (sometimes entered separately for math and verbal 
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scores and also sometimes entered nonlinearly), and X is a vector of other characteristics 
(such as race, gender) of the student. 4 If students are randomly assigned their 
roommate(s), then the estimated peer effect ( J3 2 ) will be unbiased. More generally, the 

estimate will be unbiased if it is plausible thatcov^AT™,^) = 0 . 

To broaden the analysis of roommate based peer effects, we estimate more 
general models of the form: 

o, c =«+r r +AV+A 2 cB“+£ t 

where O can be any outcome of interest and CB can be any characteristic or behavior of 
the roommate that is of interest. In the models that follow, O is always binary. Hence, 
we estimate these models using a probit estimator. 

III. Estimating Peer Effects using Roommates Characteristics: Results 

The C&B data used in this study were created and made available to us by the 
Andrew W. Mellon foundation. The C&B data contain both institutional and survey data 
for over 90,000 students enrolled in thirty-four selective colleges and universities in the 
United States for the entering classes of 1951, 1976, and 1989. This study uses data from 
two of the schools in the C&B population for the entering class of 1989. Institutional 
data in the C&B provide information on the students’ grades, major, race, gender, etc. 
Survey data contain information on a variety of post-graduate values and experiences. 
This survey was conducted in 1995-6. These data were combined with housing 

4 An appealing alternative strategy would be to include the roommate’s GPA in the regression. Such a 
variable might better measure actual rather than potential performance. The problem with including such a 
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information extracted from college phonebooks to form a unique data set that allowed us 
to identify college roommates. The two schools selected were chosen because a) they 
house their first year students together, b) the assignment mechanism of students to 
housing units (as indicated by their housing descriptions on the World Wide Web and 
conversations with their housing offices) seems roughly random, and c) the needed 
variables were available. It was necessary to use schools that choose to group first-year 
students together because the C&B data does not provide information on other classes. 

If, for example, a school allowed first and second year students to live together, we would 
have no information on the second year students. Further, it is necessary for the 
allocation to be approximately random since selection bias becomes possible when 
students are allowed to choose their roommates or if the housing office groups students in 
such a way that under or over performers are more likely to be housed together. In this 
case, the requirement that the error term be uncorrelated with the explanatory variables 
would be violated. In an earlier study of Williams freshmen by one of the authors 
(Zimmerman, 2002), it was possible to utilize data from the housing application forms to 
conduct some relatively simple analyses to check whether the assumption of random 
assignment was plausible. The schools in this sample employed a similar protocol to that 
used by Williams in using housing forms indicating sleep preferences, smoking behavior, 
etc. in assigning students to rooms/roommates - though the underlying housing form data 
was not obtained. 5 

variable is that it is simultaneously determined within the roommate context. Using such a measure would 
introduce simultaneous equation bias. 

5 See a revision of my earlier paper (Zimmerman, 2001) for a mathematical model that illustrates the 
possibility of bias in the estimated peer effects flowing from the use of housing forms in assigning students 
to rooms. It is worth noting that neither Zimmerman (1999) nor Sacerdote (2001) found evidence 
suggesting that protocols used in assigning students to roommates created a bias. The schools used in this 
analysis were selected because they employ similar housing assignment protocols. 
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Data from the Freshman survey were also appended to the C&B and phonebook 
data. Freshman survey data provides information on an array of student characteristics, 
behaviors, and viewpoints. It provides, for example, information on political viewpoints 
and career values expressed by a student prior to attending college. Thus, these variables 
would not, themselves, be the result of college influences. 

The key outcome variables used in this analysis are measures of economic and 
social liberalism, along with a measure capturing students’ emphasis on intellectual 
versus career values. These will constitute the dependent variables in the models that 
follow. 6 We also form measures of students’ political viewpoints and emphasis on 
intellectual or career values using Freshman survey data. These viewpoints were formed 
before the student entered college and are used as explanatory variables (along with 
gender, race/ethnicity, major, SAT scores) in our models. Thus, we ask: holding 
constant individuals’ political persuasions (or intellectual/career focus) when they entered 
college, what was the effect on a student’s political viewpoint (or career/career focus) of 
having a more politically liberal (or more academically inclined) roommate? 

To form the outcome measures of economic and political conservatism, we 
utilized the following question found in the C&B survey: “Thinking about your views 
concerning economic and social issues, where would you place yourself on the scale 
below?” 





Very 




Very 




Conservative 




Liberal 


Economic Issues... 


1 2 


3 


4 


Social Issues... 


1 2 


3 


4 
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Higher scores on this scale indicate greater economic or social liberalism. The survey 
was conducted in 1995-6 - so the answers pertain to students’ post-college political 
position. We estimate probit models where we define students as “Conservative” if they 
coded their response 1 or 2 and as “Liberal” if they coded their response as 4 or 5. 

Prior (pre-college) political viewpoints were formed in two different ways. First, 
we used a variable from the freshman survey where students were asked to characterize 
their political views as “far right,” “conservative,” “middle of the road,” “liberal” or “far 
left.” Alternatively, we used a principal components analysis to form a single “political 
viewpoint” variable from a number of questions in the Freshman Survey. Students, for 
example, were asked whether they disagree or agree (strongly or somewhat) with various 
statements. For instance, they were asked whether they thought the “government [was] 
not controlling pollution,” or there was “too much concern for the rights of criminals” or 
that the government should “prohibit homosexual relations.” The resulting variable was 
continuous ranging from 1 (conservative) to 5 (liberal). The variable used in the analysis 
coded the resulting political sentiments principal components variable as dummy 
variables with scores in the bottom 15% being coded as “conservative”, in the top 15% 
as “liberal”, and “moderate” otherwise. 

To measure career and intellectual values we utilized a measure developed by 
Katchadourian and Boli in their book Cream of the Crop: The Impact of Elite Education 
in the Decade After College (1994), which presents a longitudinal study of the Stanford 
class of 1984. Part of their methodology involves rating students on two scales - 
intellectualism and careerism - and dividing the students into 4 groups. The variables 

6 See Winston and Zimmerman (2003) for estimates on academic peer effects (i.e. GPA) using these data 
and one additional school. 
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used for grouping the students were taken from the Freshman Survey and were based on 
the reasons students gave for attending college and also on the aspects of a job they 
consider important in their long-term career choice. For example, students could indicate 
the importance of various reasons for attending college such as: “to get a better job” or 
“to make more money.” Alternatively, they might emphasize “to learn more about 
things” or “to become a more cultured person.” They might indicate that their career 
choice is heavily influenced by “high anticipated earnings” or “rapid career 
advancement.” These variables were combined into measures of intellectualism and 
careerism using a principal components analysis. The resulting characteristics were 
scored on a 100 point scale. Students were then divided into four groups based on their 
median scores on these two variables. We’ve characterized the groups as follows. 
“Strivers” scored above the median on both intellectualism and careerism; “intellectuals” 
scored above the median in intellectualism but below on careerism; “unconnected” 
students scored below the median on both intellectualism and careerism; and “careerists” 
scored below the median on intellectualism but above on careerism. We created similar 
scales for these groups from the College & Beyond survey responses. We can then use 
these measures to see whether a roommate’s intellectualism, careerism, and other traits 
had an effect on students’ later intellectualism and careerism, etc. 

Table 2 reports the probit parameter estimates (derivatives) for the economic 
political viewpoints models. The first and second column show the estimates using the 
freshman survey measures of political orientation. Here we see that one’s own pre- 
college political views have a strong predictive effect on students’ post-college political 
views. Being “far right”, for example, is associated with a .624 percentage point increase 
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in the odds that an individual is conservative post-graduation compared with a student 
who was on the “far left” of the political spectrum when they entered college. The 
roommate variables have no significant effect when the outcome is “liberal” (see the first 
column in Table 2). Estimates in column two, however, suggest that a student (post- 
college) is (holding their own pre-college views constant) less likely to be politically 
conservative if their roommate is politically moderate or liberal than if they are “far left” 
- the omitted category. Roommate parameter estimates based on the principal 
components definitions of political orientation are found in the third and fourth columns. 
Roommate’s political persuasion is not significant in either of these models. 

Table 3 reports the probit parameter estimates for the social political viewpoints 
models. Again, an individual’s pre-college political orientation has a strong effect on 
their later political persuasion. Roommate effects do not have a significant effect on 
whether an individual considers herself “liberal” post college. They do, however, affect 
whether a person reports being socially “conservative.” Here we see that individuals 
whose roommates are either “middle of the road” or “liberal” are less likely to be 
conservative than individuals whose roommates considered themselves “far left” - the 
omitted category. Roommate parameter estimates based on the principal components 
definitions of political orientation are found in the third and fourth columns. 

Roommate’s political persuasion is not significant in either of these models. 

Table 4 reports the probit parameter estimates for the “Striver,” “Intellectual,” 
“Unconnected,” and “Careerist” models. The dependent variable in each case is a 
dummy variable indicating whether the individual is classified as a member of that 
particular group. Here we see two things. First, the fact that an individual had traits that 
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would define them as a striver, intellectual, or unconnected during the pre-college period 
does not strongly predict whether they would be similarly classified post-college. Thus, 
for example, students giving patterns of answers to Freshman survey questions that 
would suggest a particularly intellectual slant are not statistically significantly more 
likely to give answers to C&B survey questions in 1995-6 that would define them as 
intellectuals. The same holds true for strivers and unconnected students. Pre-college 
careerist sentiments do, however, predict post-college careerist sentiments. In none of 
these models, however, does the roommate’s classification have a significant effect. 

IV. Estimating Peer Effects by Clustering Schools 

We now turn to estimating a broader form of peer effects. Our goal is to answer: 
how would a student’s outcomes have changed if they’d attended a very different school? 
This is a somewhat different question than that addressed in the last section. There, we 
asked whether differences in peer environment at the roommate level had measurable 
effects on students’ political views or their level of intellectualism versus careerism. The 
question we are addressing in this section is more difficult to answer and entails an 
analysis that provides a way of measuring a schools peer “ethos.” We think of this as the 
schools aggregate peer environment. This is certainly a murky concept, but we think 
there is something to it. Most people would agree that Williams is quite different than 
Swarthmore or Oberlin. Yet, these institutions are similar in their selectivity, curriculum, 
and infrastructure. A key feature of the difference would seem to be that the median 
student at these schools are quite different. Some schools are more politically 
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progressive than others. Some have more of a party culture. Others are more intellectual. 
School guidebooks certainly describe schools as having different “ethos.” To quote some 
examples from the popular Fiske Guide to Colleges: 

“Friedrich Nietzsche once wrote, “That which does not kill me makes me 
stronger.” He might well have been talking about Swarthmore College. . .” 

At Duke “people do the work with a smile and have a beer later.” 

“. . .Oberlin remains a bastion of liberal thought and a mecca for the socially and 
politically correct (yes, there are some Dead Heads and granola munchers...)” 

Vanderbilt “as a whole tends to be very conservative and ‘Southern’ in its 
views...” 

“Dennison’s biggest problem may be in overcoming its reputation as a party 
school.” 

Our goal here is to provide a plausible way of grouping schools that are most 
similar in terms of the interests and values of their student body. That is, the groups we 
create should be comprised of students that are similar in some basic ways. We might 
imagine moving students between these schools and having them feel right at home. We 
can then consider students who are “outliers” in the sense that they have interests and 
values that are very different than those of the median student at the school they attend. 
They might be regarded as, for example, “Princeton students” at Oberlin or “Williams 
students” at Swarthmore. Thus, we might contrast the outcomes of students who have 
characteristics that match well with their school’s ethos with students whose 
characteristics do not. That is our basic strategy. 

When attempting to categorize institutions of higher education, studies typically 
rely upon either relatively simple binary measures or preexisting taxonomies in order to 
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classify them. Thus, numerous studies divide institutions along lines of private versus 
public control, religious versus secular nature, and so forth. Research also often relies 
upon systems like the Carnegie classifications that group all institutions according to their 
type, for example, “Doctoral/Research Universities - Intensive” or “Baccalaureate 
Colleges - Liberal Arts”. Studies occasionally use a discrete variable like selectivity, but 
generally use that variable to create several different groups (e.g. “high selectivity”, “low 
selectivity”). These methods were undesirable for our study, however, because although 
these distinctions may have some correlation with or even indirect effect on institutional 
environments, they are nonetheless very different from peer effects and running 
regressions using these divisions would only determine the overall effect of, for example, 
public versus private control. Instead, we used nine discrete variables - from a group of 
thirteen C&B schools - derived from a survey of entering freshmen, all of which describe 

' 7 

the respondent’s interests in some way. By creating clusters of institutions based upon 
the characteristics of students at those institutions, it is possible to classify institutions 
solely from student traits, and therefore, since institutional dummy variables are included 
in the regressions, any differences in outcomes should result only from peer effects and 
not from any other differences between institutions. 

To determine whether students could be said to be “mismatched” with the other 
students at their institution, we first had to derive traits with which they could be 
compared to their classmates. The Astin freshmen survey contained numerous questions 
indicating respondents’ activities during high school, views on various issues, 

7 The schools used were Kenyon College, Stanford University, Wesleyan University, Wellesley College, 
Princeton University, University of North Carolina, Miami University (Ohio), Oberlin College, Bryn Mawr 
College, Pennsylvania State University, University of Pennsylvania, Vanderbilt University, and Williams 
College. These schools had all of the variables needed to conduct the analyses. 
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background experiences, and other relevant factors from which traits could be derived. 
Therefore, particular questions from the survey were taken and used to construct nine 
traits, all scaled equally for a range of 0-1, for each individual student. These included: 

1. Religiosity - A measure of how religious a student was (e.g. “Frequency of 
attending church services”). 

2. Community Service and Political Activism - A measure of how involved with 
community service and political activism a student had been in high school and 
importance placed on them (e.g. “Frequency of performing volunteer work last 
year”). 

3. Athleticism - A measure of how involved with athletics and exercise a student 
had been in high school (e.g. “Hours per week in last year of high school spent on 
exercising and sports”). 

4. Partying - A measure of how involved in partying and related activities a student 
had been in high school (e.g. “Frequency of drinking beer in last year”). 

5. Cultural and Intellectual Interest - A measure of how interested in cultural and 
intellectual pursuits a student was (e.g. “in going to college, how important was 
‘to make me a more cultivated person’”). 

6. Artistic and Literary Interest - A measure of how much importance students 
placed on these interests and their self-ratings on artistic and literary ability (e.g. 
“Importance of creating artistic work”). 

7. Drive - A measure of students’ self-ratings on competitiveness, drive to achieve, 
and leadership ability (e.g. self rating on drive to achieve). 

8. Popularity - A measure of how much time students spent socializing in high 
school and their self-ratings on a variety of popularity-related traits (e.g. self 
rating on popularity). 

9. Politics - A measure of where on a one-dimensional spectrum a student fell 
derived from many questions about their views on specific issues and one 
question about their general political views. A higher score means further to the 
right (e.g. agreement with “abortion should be legalized”). 

A full list of the variables used to create the nine variables is found in Table 5. 
Once these traits had been calculated for each individual student, it became possible to 
determine the overall characteristics of the entering class for each institution by averaging 
the scores for each trait of the students at each institution. Once averages were produced, 
the institutions could be compared on each trait. The thirteen institutions were divided 



16 




